Semantics and Discourse Processing for Expressive TTS
نویسندگان
چکیده
In this paper we present ongoing work to produce an expressive TTS reader that can be used both in text and dialogue applications. The system has been previously used to read (English) poetry and it has now been extended to apply to short stories. The text is fully analyzed both at phonetic and phonological level, and at syntactic and semantic level. The core of the system is the Prosodic Manager which takes as input discourse structures and relations and uses this information to modify parameters for the TTS accordingly. The text is transformed into a poem-like structures, where each line corresponds to a Breath Group, semantically and syntactically consistent. Stanzas correspond to paragraph boundaries. Analogical parameters are related to ToBI theoretical indices but their number is doubled.
منابع مشابه
SPARSAR: An Expressive Poetry Reader
We present SPARSAR, a system for the automatic analysis of poetry(and text) style which makes use of NLP tools like tokenizers, sentence splitters, NER (Name Entity Recognition) tools, and taggers. In addition the system adds syntactic and semantic structural analysis and prosodic modeling. We do a dependency mapping to analyse the verbal complex and determine Discourse Structure. Another impor...
متن کاملA Discourse Analysis of “The Prince and His Companions” in Kelileh and Demneh Based On Semio-Semantics
Despite showing an overtly simple structure, the semantic process in classic literary-narrative discourse conforms to complicated semiotic systems. As a result, semio-semantics is deemed as one of the most scientific, reliable tools since it helps intradiscursive semio-textual propositions be phenomenologically, and even epistemologically, analyzed. Consequently, the narrative discourse in “The...
متن کاملExpressivity in TTS from Semantics and Pragmatics
In this paper we present ongoing work to produce an expressive TTS reader that can be used both in text and dialogue applications. The system called SPARSAR has been used to read (English) poetry so far but it can now be applied to any text. The text is fully analyzed both at phonetic and phonological level, and at syntactic and semantic level. In addition, the system has access to a restricted...
متن کاملReasoning in Open Domains
In this paper we modify the semantics of epistemic speci cations (and hence the answer set semantics of extended logic program and disjunctive databases) to allow for reasoning in the absence of domain{closure assumption. This modi cation increases the expressive power of the language and allows one to explicitly state the domain{closure and other assumptions about the domain of discourse in th...
متن کاملComparison of chironomic stylization versus statistical modeling of prosody for expressive speech synthesis
Chironomic stylization is the process of real-time modification of intonation contours (f0 and tempo) using drawing/writing gestures with a stylus on a graphic tablet. The question addressed in this research is whether hand-made intonation stylization could improve or degrade expressivity and overall quality, compared to statistical modeling of prosody. A system for expressive TTS in French bas...
متن کامل